place class
Continual Multi-Robot Learning from Black-Box Visual Place Recognition Models
Tsukahara, Kenta, Tanaka, Kanji, Iwata, Daiki, Liang, Jonathan Tay Yu
In the context of visual place recognition (VPR), continual learning (CL) techniques offer significant potential for avoiding catastrophic forgetting when learning new places. However, existing CL methods often focus on knowledge transfer from a known model to a new one, overlooking the existence of unknown black-box models. We explore a novel multi-robot CL approach that enables knowledge transfer from black-box VPR models (teachers), such as those of local robots encountered by traveler robots (students) in unknown environments. Specifically, we introduce Membership Inference Attack, or MIA, the only major privacy attack applicable to black-box models, and leverage it to reconstruct pseudo training sets, which serve as the key knowledge to be exchanged between robots, from black-box VPR models. Furthermore, we aim to overcome the inherently low sampling efficiency of MIA by leveraging insights on place class prediction distribution and un-learned class detection imported from the VPR literature as a prior distribution. We also analyze both the individual effects of these methods and their combined impact. Experimental results demonstrate that our black-box MIA (BB-MIA) approach is remarkably powerful despite its simplicity, significantly enhancing the VPR capability of lower-performing robots through brief communication with other robots. This study contributes to optimizing knowledge sharing between robots in VPR and enhancing autonomy in open-world environments with multi-robot systems that are fault-tolerant and scalable.
Recursive Distillation for Open-Set Distributed Robot Localization
Tsukahara, Kenta, Tanaka, Kanji
A typical assumption in state-of-the-art self-localization models is that an annotated training dataset is available for the target workspace. However, this is not necessarily true when a robot travels around the general open world. This work introduces a novel training scheme for open-world distributed robot systems. In our scheme, a robot (``student") can ask the other robots it meets at unfamiliar places (``teachers") for guidance. Specifically, a pseudo-training dataset is reconstructed from the teacher model and then used for continual learning of the student model under domain, class, and vocabulary incremental setup. Unlike typical knowledge transfer schemes, our scheme introduces only minimal assumptions on the teacher model, so that it can handle various types of open-set teachers, including those uncooperative, untrainable (e.g., image retrieval engines), or black-box teachers (i.e., data privacy). In this paper, we investigate a ranking function as an instance of such generic models, using a challenging data-free recursive distillation scenario, where a student once trained can recursively join the next-generation open teacher set.
Cross-view Self-localization from Synthesized Scene-graphs
Yamamoto, Ryogo, Tanaka, Kanji
Cross-view self-localization is a challenging scenario of visual place recognition in which database images are provided from sparse viewpoints. Recently, an approach for synthesizing database images from unseen viewpoints using NeRF (Neural Radiance Fields) technology has emerged with impressive performance. However, synthesized images provided by these techniques are often of lower quality than the original images, and furthermore they significantly increase the storage cost of the database. In this study, we explore a new hybrid scene model that combines the advantages of view-invariant appearance features computed from raw images and view-dependent spatial-semantic features computed from synthesized images. These two types of features are then fused into scene graphs, and compressively learned and recognized by a graph neural network. The effectiveness of the proposed method was verified using a novel cross-view self-localization dataset with many unseen views generated using a photorealistic Habitat simulator.
Compressive Self-localization Using Relative Attribute Embedding
Yamamoto, Ryogo, Tanaka, Kanji
Abstract-- The use of relative attribute (e.g., beautiful, safe, convenient) -based image embeddings in visual place recognition, as a domain-adaptive compact image descriptor that is orthogonal to the typical approach of absolute attribute (e.g., color, shape, texture) -based image embeddings, is explored in this paper. Most current state-of-the-art visual place recognition (VPR) algorithms employ absolute attribute (e.g., color, shape, texture) -based image embedding for image feature with a boundary condition description [1]-[3] and image similarity search [4]. In this study, we are interested in relative attributes (e.g., beautiful, The objective is to search for the image most relevant to a given query image over an image database. B. Descriptor Similarity The database is constructed as a collection of viewpointannotated Next, descriptor similarity is evaluated between the input view images from visual experiences in the training descriptor R and each database descriptor R Specifically, the procedure for construction consists of two The first method, called binary relative strength (BRS), treats steps (Figure 1): (1) extracting a feature descriptor from the the descriptor as a binary relative attribute (stronger or image, and (2) evaluating the descriptor similarity between weaker), and evaluates the similarity by the query and each database images. Either step is detailed in the following.